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Regularization of chattering phenomena via bounded 

variation controls 

Marco Caponigro* Roberta Ghezzi^ Benedetto Piccoli* Emmanuel Trelat^ 



Abstract 



In optimal control there may be bad oscillatory phenomena. For instance, this is the case 
for Fuller's phenomenon [5] where the optimal control is bang-bang with an infinite number of 
| switchings in finite time. In the framework of hybrid systems, there are optimization problems 

£S| ■ where the discrete part of the dynamics switches infinitely many times in finite time |13) (and this 

is known as Zcno's phenomenon.) 

In this paper we provide a technique to regularize this kind of phenomena. Namely, we define a 
BV regularization of a general optimal control problem and show that the solution of the perturbed 
problem is quasi-optimal for the reference problem in the sense that the loss of optimality is small. 
We apply our results to hybrid systems and we estimate the decay of the error, when the total 
. variation of the control grows. 

1 Introduction 

<N 

Chattering phenomena in optimal control have been known since the first example presented in [9]. 
■ Roughly speaking, chattering is a degeneracy phenomenon in which a control oscillates or switches 

t^-- | infinitely many times over a finite time interval. To explain this behavior, let us recall the example in 

[9], also known as Fuller's problem. Consider the control system 



c*v \ Xl X2 ' (i) 

{ x 2 = u, 

where an external agent acts on the system by modifying the acceleration, and the control u : [0, T] — > 
^ , [—1, 1] is measurable. The optimal control problem is to minimize the functional 

x\{t)dt (2) 



o 



among trajectories of ([I]) steering an initial point (x^x®) to the origin, i.e., satisfying initial and 
terminal constraints 

x 1 (0)=x° 1 , x 2 {0)=x° 2 , 

xi(T) = 0, x 2 (T) = 0. {> 
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It has been shown that for every (ij, x®) there exists a unique control u° : [0,T°] -> [-1, 1] which is a 
solution to dl]), ([2]), ([3]) and which has the following form 

(t \ = l h te[t 2k ,t 2k+ i],keN 
u{) 1 -l, te[t 2k+1 ,t 2k+2 ],keN 

where {tfcjfceN is an increasing sequence of switching times, depending on the initial condition (xj, x®) 
and converging to T° < oo. Although at first sight one could think this kind of degeneracy is related 
to specific symmetries of the system, it turns out that this behavior is rather typical. Indeed, it was 
later shown in the fundamental work |14| that the set of single-input optimal control problems which 
have control- affine Hamiltonian and whose solution is chattering is an open semi-algebraic set. 

Optimal controls showing this degenerate structure has been found for a variety of problems: 
besides the ones mentioned above, this phenomenon also concerns state constrained problems and 
hybrid systems. For instance, in [16j the author studied an optimal control problem with an inequality 
state constraint of third order and showed that the optimal trajectory touches the constraint boundary 
at an infinite sequence of times approaching a finite limit (Robbins' phenomenon). As it happens for 
Fuller's phenomenon, it turns out that Robbins' phenomenon is in a sense generic, when the order 
of the state constraint is sufficiently high, see In the framework of hybrid systems, chattering is 
also known as Zeno's phenomenon and it is related to the presence of trajectories whose discrete part 
jumps infinitely many times in a finite time interval (see the two examples in jllj). 

Chattering causes several difficulties in theoretical aspects of optimal control as well as in appli- 
cations. From a theoretical point of view it prevents a direct application of Pontryagin's Maximum 
Principle because of the lack of a positive length interval where the control function is continuous. This 
implies that, when chattering is included in the analysis, finding necessary and sufficient optimality 
conditions becomes more intricate (see [8] for the case of state constrained problems) . Some results in 
this sense were proved in |18j . yet the problem is not completely understood in many contexts, such 
as state constrained problems or hybrid systems [13]. Another delicate issue comes from the study of 
regularity properties of optimal syntheses [5| 115]. 

As concerns applications, chattering phenomena are often an obstacle when using numerical meth- 
ods to attack optimal control problems. For instance, for single-input problems with a scalar state 
constraint the presence of chattering may imply ill-posedness (non-invertible Jacobian) of shooting 
methods [3]. Likewise, as remarked by the authors in p], the interior point-based algorithm developed 
appears not to converge when applied to Fuller's problem. 

The main motivation of this paper is to provide convergence of numerical methods when chattering 
takes place. We consider a fully nonlinear control system and optimize a cost of Bolza type with 
continuous Lagrangian. We define a sequence of relaxed problems obtained by adding a term of 
total variation of the control in the cost functional. Thanks to small time local controllability, under 
equiboundedness of trajectories and some convexity properties of the set of velocities for the augmented 
system, we show the convergence of the cost functional along a sequence of solutions of the relaxed 
problems. 

We also study how fast the cost converges as the total variation of controls in the approximating 
sequence grows. Namely, we estimate explicitly the rate of convergence of the cost along suboptimal 
regimes obtained by suitable truncations of the chattering one in terms of switching times. A related 
result was proved in |19[ [20] for small perturbations of the Fuller's problem. In those papers, the 
authors exhibit a sequence of suboptimal regimes for a control system having ([1]) as principal part and 
for the cost ([2]) and they prove that the cost converges with the same rate as the sequence of switching 
times (of the chattering control). Here we obtain a similar result for nonlinear control systems with 
a general cost of integral type, only assuming small time local controllability at the final point and 
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holderianity of the time-optimal map. In particular, this allows us to sharpen the convergence rate of 
the cost along the sequence of solutions of the relaxed problem. 

Notice that for the case considered in [20] , the rate of convergence happens to be exponential as a 
function of the number of switchings. Likewise, for the class of systems considered in p3], the switching 
times converge exponentially to the final time. Nevertheless, whether a slower rate of convergence is 
"typical" still remains an open question. 

Finally, we apply our result to optimization problems for hybrid systems, obtaining estimates of 
the cost convergence as the number of switchings grows. 

The paper is organized as follows. In Section [2] we define chattering and state the main results. 
Applications of the main results to hybrid systems are given in Section [3l Section U] is devoted to the 
proofs of Theorems Q] and [2j Finally, we show in Appendix [5] an existence result for optimal control 
problems without convexity assumptions, using a total variation term in the cost. 

2 Main results 

Consider the control system 

x = f(x,u), uGU, (4) 
where / G C°°(R N x R m , R N ) and /(0, 0) = 0. Given U C R m , set 

U = {u : [0,t u ] — >■ U measurable, t u > 0}. 

Denote by 

T = {/(•, tt) : R N -»■ R N | u £ U} 

the family of vector fields associated with the dynamics of (|4|). Given an initial state xo, a Lagrangian 
L £ C°(R x R N x R m ), and e^Owe consider the optimal control problem 

min I / L(s,x(s),u(s))ds + eTV(u) 
u£U,T>0 \J 

with initial and final constraints 

x{0) = x , x{T) = 0, (6) 
where TV(u) denotes the total variation of u. 

Definition 1. By chattering control we mean a measurable u : [0,t u ] — > U such that there exists an 
increasing sequence {t n }neN converging to t u with the property that TV(u\iq^) < oo for every n, and 

lim TV(u\ [otn] ) = +oo. 

In the following result we consider the case in which problem ([5]), ©, with e = has an 
optimal solution which either has bounded total variation either is chattering in the sense above. 

Theorem 1. Assume that 

(i) Lieo-7 7 = ToIR^ and is small time locally controllable for the control system ([4]); 

(ii) the optimal control problem (J3J) , {[5]) , ([6j) with e = admits a unique solution u* : [0,T*] — > U 
whose corresponding trajectory is denoted by x* ; 
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(Hi) for every (t, x) € R x W N the set 



lim 

£->0 



K(t, x) = {(/(*, u), L(i, x, u) + 7) I u e U, 7 > 0} 

is convex; 

(iv) there exists b > suc/i t/iai, /or every u £ U whose corresponding trajectory satisfies ([6]), we 

^ a "I - 1 1 *^ia 1 1 00 ^ b 

and U is compact. 

Then, for every e > 0, the optimal control problem (HJ, ©, ©, admits a solution u £ : [0,T e ] — >■ U. 
Moreover, 

I * L(t,x £ (t),u £ (t))dt = I L(t,x*(t),u*(t))dt. (7) 

Theorem [TJ implies that whenever the optimal control u* is chattering, it can be replaced by a 
non-chattering one u £ , paying a price that goes to zero as the total variation of u £ grows. In other 
words, u £ is quasi-optimal for the problem with e = in the sense that the cost along u £ converges 
to the value function of the reference problem. To prove Theorem [TJ we use an auxiliary result (see 
Lemma [2] in Section which provides convergence of the cost along a sequence of controls obtained 
by truncating the optimal chattering control, without the convexity assumptions (Hi) or the a priori 
estimates (iv) on trajectories. 

Remark 1. According to Definition [TJ a control having one accumulation of switchings is chattering. 
When a solution of ([3]), © with e = 0, © presents a finite number of accumulations of switchings 
the problem can be tackled by applying iteratively Theorem [TJ 

The estimate on the error as the total variation grows can be sharpened. This is the statement 
of next theorem which gives the existence of suboptimal controls whose associated cost converges 
sufficiently fast to the optimal one. The rate of convergence is related to the regularity of the time- 
optimal map defined below. The idea is to consider the optimal control u* until a time T* — 77 and then 
steer the system to the origin with a control having total variation uniformly bounded with respect 
to rj. 

We define the time-optimal map associated with the problem (jTJ), © as 

T(a? ) = inf{T > | x = f(x,u),x(0) = x ,x(T) = 0}. 

Theorem 2. Assume that 

(i) Lieo T = ToIR^ and is small time locally controllable for the control system @ . 

(ii) The optimal control problem ([T]), ([5]), ([6]) with e = admits a unique solution u* : [0, T*] — >• U 
whose corresponding trajectory is denoted by x* . 

(Hi) The time-optimal map is C°' a for some a E (0, 1] on a neighborhood of 0. 

Then there exist t]q > and C > such that for any 7] < t]q there exists an admissible control 
v v : [0, T Tj ] — > U whose corresponding trajectory x^ : [0, TJ — > M, N starting at xq satisfies the terminal 
constraint x v (T v ) = 0, such that TV(vrj) < 00, and 

f n L(t,x v ,v v )dt- f L(t,x*,u*)dt^Cn a . (8) 
Jo Jo 
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Moreover, the following convergences hold 

lim T n = T* 

lim ||u„ — it* II ri =0 

lim \\x„ — x* ||oo = 0. 

J7->0 ' 

Remark 2. Sufficient conditions for Assumption (Hi) have been studied in [2]. Moreover the authors 
provide an estimate on the Holder exponent. 

The following corollary combines Theorems [T] and [2 It gives an estimate on the rate of convergence 
of the cost for the optimal control problem ([I]),©,© with e > to the optimal cost with e = in 
terms of the growth of the total variations of the optimal controls. 

Corollary 1. Let assumptions (i) — (iv) of Theorem^ hold. Assume moreover that 

(v) the time-optimal map T is C 0,a for some a € (0, 1] on a neighborhood of 0; 

Then, for every e > 0, the optimal control problem (jlj), (|5|). §6§, admits a solution u e : [0,T e ] — > U. 
Moreover, there exists M > such that 

T e i-T* 

L(t,x £ (t),u e (t))dt- / L(t,x*(t),u*(t))dt^M(fi a + e), (9) 
Jo 

where \i is such that 

TV(u*\ [0tT ,_^)^TV(u £ ). 

3 Application to hybrid systems 

In this section, we prove a direct adaptation of Theorem [1] to hybrid systems. To this aim, we first 
recall some basic definitions in the context of hybrid systems (see also [101 [13] ) . 
A hybrid system is a collection H = (Q, X, f, E, G, R) where 

• Q is a finite set; 

• X = {Xq} q( zQ is a collection of subsets X q C M> N called locations; 

• / = {f q }q£Q is a collection of smooth vector fields f q : M N — > ~R N ; 

• E C Q x Q is a subset of edges; 

• G is a set-valued map that associates with each edge (q, q') G E a subset G(q, q') C A g called 
guard set; 

• R is a set-valued map that associates with each pair ((g, q'),x) 6£xl,a subset q'),x) C 

A trajectory of % is a triple (r, </(•), where 

• r = {Tj}*£ is a sequence of increasing positive numbers such that tq = and M ^ oo. Set 
I = [0, t m ] if M < oo, I = [0, tm) if M = oo. 

• g : I — > Q is such that, for every i = 0, . . . M — 1, g(t) is constant on [Ti,Ti + i). Set qi = q\\ TitTi+1 ) 
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• for every i = 0, ... ,M — 1, Xj(-) = sc|( Ti)T ) is an absolutely continuous function in (Tj,Tj + i), 
continuously prolongable to [rj, Tj+i] and such that Xi(t) G X qi . 

• for almost every t G (r^Tj+i) 

h = f qi (xi). (10) 

• for every i = 0, . . . , M — 1 (qt, qi+i) G Xifa+i) G G(qi,qi + \) and, for every i = 0, . . . , M - 2, 
x i+ i(r» + i) G J?((gi,g +1 ),a;i(Ti + i)). 

A trajectory (r, g(-), is called Zeno if M = oc and Too < oo. Note that in this case we have 



oo 

= y^Tj < OO. 



' oo 

i=0 



Given a hybrid system a Lagrangian for % is a family L = {Lq} gg Q, L q :M x X q — > M. such that, 
for every trajectory (t,q(-),x(-)) of H and every i = 0, ...M — 1, the function i i-> L qi (t,Xi(t)) is 
continuous in (tj, ti+i) . Given a Lagrangian for 7~L, we can define the corresponding cost functional C 
by 

M-l . ti+1 

C(r,q(-),x(-)) =J2 L q .(t, Xi (t))dt 
i=0 Jti 

Casting Theorem Q] in the language of hybrid systems we obtain the following result. Let T~L = 
(Q, X, f, E, G, R) be a hybrid system with a Lagrangian L and corresponding cost functional C. Fix 
an initial condition (qo,xo) G Q x X qo . 

Theorem 3. Let % be a hybrid system such that X q = M. N for every q G Q and let L be a 
Lagrangian for H with corresponding cost functional C(-). Assume that (t* , q* (•) , x* (•)) is a Zeno 
trajectory starting at (qo,xo) and that 

C(r*,g*(0, **(•)) = , min C(r,q(-),x(-)) < oo, 

where the minimum is taken over all trajectories of % starting at (qo,xo). Let r* = {r*}°^ . Define 
the sequence of trajectories (r n , q n (-), x n (-)) by 

• Q n (t) = q*(t) for every t G [0,r*) ; q n (t) = q*(r*) for t G [r n ,r^]; 

• x n {t) = x*(t) for every t G [0, r*] and for almost every t G [t*,t^] x n satisfies 

x n {t)=f q * {<) {x n {t)). 

Then the following convergences hold 

IK(-)-x*(-)||oo = 0(r^-<) (11) 

C(r",g»(-) ja ; n (.)) -C(r*, ?*(■), = 0(r^-<). (12) 

The main idea is to interpret the role of the discrete part of the hybrid system in (|10p as a control. 
Since there are no final constraints the proof simplifies with respect to the proof of Theorems [TJ [2J 



lr The assumption X q — M. N is not essential and can be dropped, provided that each location X q is invariant with 
respect to the vector field /„/ for every q 
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Proof. Since q n (t) converges to q*(t) almost everywhere in [0, r^,], by [TTJ Theorem 1 p. 57] we deduce 
pip . As for (|12p . notice that since the Lagrangian is continuous, there exist constants c, c such that 
c — c > 0, for every n 

/ L q * (<) (t,X n (t))dt^c(T^-T*), 

and, for every i, 

L q *( T *)(t,x*(t)) ^ c, almost everywhere in [r*,r* +1 ]. 

Therefore, 

< C(r n ,q n (-),x n (-))-C(r*,q*(-),x*(-)) = / °° L q * {Tni) (t,x n (t))dt-J2 / * + L q * [t) (t,x*(t))dt 

J < i= n Jr t 

oo 

< ~<rlo-<)-cY,(r* +x -T*) = {c-c){r* 00 -r* n ). 

i=n 

This concludes the proof. □ 



4 Proofs of the main results 
4.1 Proof of Theorem CD 

Before going into technical details, let us outline the proof of Theorem [TJ First, we remark that 
the local controllability assumption provides existence of a solution u £ ,x £ to (|3J), ([S}, (|6|), for e > 0. 
Second, we show that, thanks to the assumptions on the velocity sets and the equiboundedness of 
trajectories, there exists an admissible control w and a positive measurable function 7 such that 
£(->■ (f(x e (t),u e (t)),L(t,x e (t),u e (t)) converges to 1 1-4 (f(x w (t), w(t)), L(t, x w (t), w(t)) + 7(f)) for the 
weak* topology in Third, we use the optimality of ii e to prove that w is optimal for the problem 
with e = 0, which by uniqueness implies w = u* . Fourth, we demonstrate 7 = which implies that 
the Lagrangian cost along u £ converges to the Lagrangian cost at u*. This fact is proved thanks 
to Lemma [2] which exhibits a sequence of admissible controls for which TV^i^) < 00 and the 
Lagrangian cost converges to the Lagrangian cost at u* . Finally, to construct v v we use a topological 
result, Lemma (H providing admissible controls steering a point to the origin with total variation 
uniformly bounded in a neighborhood of the origin. 

For the sake of readability, we start by presenting the two auxiliary lemmas mentioned above and 
then proceed to the proof of the theorem. 

Lemma 1. Assume that Lie 7" = T R N , the origin is small time locally controllable and let Tq > 0. 
Then there exist r, M > such that for every y with \y\ ^ r there exists a piecewise constant control 
Wy : [0, T y ] — > U with the following properties: 

(i) w y steers y to in time t v ^ To; 
(ii) sup \ y \ <r TV (wy) ^ M. 

Proof. Call A(x, [0, T), /) the set of points accessible from x in time t < T by trajectories of the 
control system x = f(x, u),u € U. Since is STLC for the control system (j4|) defined by /, thanks to 
\\2\ Theorem 5.3 a-d], it is also STLC for the system associated with — /. Therefore, for every T > 
there exists a ball B r (0) centered in the origin of radius r compactly contained in .4.(0, [0, T), — /). 
Since / is autonomous, for every y G B r (0), we have 

0eA(y,[0,T),f) yeA(0,[0,T),-f). 
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Since is STLC for — /, Theorem 5.3 in [T5| implies that is is small time normally self reachable 
(STNSR) (see \12\ Definition 3.6]) for the system associated with — /. Applying [12\ Theorem 5.5], 
since y G -4(0, [0, T), — /) and is STNSR for — / then y is normally reachable (see [121 Definition 3.6]) 
from in time less than T for the system associated with — /. Namely there exist q = q(y) G N, 
u\, . . . Ug G U and (ti, . . . , t q ) with t\ + ■ ■ ■ + t q ^ T such that the map 

(ai, . . . , s q ) H- E U ( S1 , ...,s q ) = 0o e -^/(^i) o • • • o e -*/(-.««), (13) 

which is defined and smooth in a neighborhood of (t\, . . . ,t q ), satisfies E u (t±, . . . ,t q ) = y and has 
maximal rank (equal to N) at (fx, . . . , t q ). Hence there exist a neighborhood O y C -4(0, [0, T), —f) of 
y on which the map E u is onto. Moreover, by continuity of the map (|13p . there exists c = c(y) such 
that 

sup { ai + ... + Sg | { Sl ,...,s g ) GMuE^ix)} ^c(y)T. 
xeOy 

In other words every point of O y is reachable from with piecewise constant control with less than 
q = q(y) switchings in time smaller than c(y)T via the system x = —f(x,u). Reversing time we have 
that for every point x in O y there exists a piecewise constant control with less than q = q(y) switchings 
steering system x = f(x,u) from x to in time smaller than c(y)T. 

By compactness, there exist yi, ■ ■ ■ ,y m such that B r (0) C UfcLi Vk and, for T sufficiently small, 
we can assume that TY^k=\ c{yk) < Tq. Hence, using the argument above, every point y G -B r (0) can 
be steered to in time smaller than T^2™ =1 c(yk) by means of a piecewise constant control with less 
than Ylk^iQiVk) switchings. □ 

Lemma 2. Assume that 

(i) Lieo-7 7 = ToIR^ and is small time locally controllable for the control system 

{ii) The optimal control problem ([5]), ([6]) with e = admits a unique solution u* : [0, T*] — > U 
whose corresponding trajectory is denoted by x* . 

Then there exists t]q > such that for any r\ < r]Q there exists an admissible control : [0, T„] — > 
U whose corresponding trajectory x r/ : [0, T„] — > M. N starting at xq satisfies the terminal constraint 
Xr) (T v ) = 0, such that TV(v v ) ^ TV(u* |[ ,t*-»j]) + M, and 

lim / L(t ,Xrj,Vr])dt — / L(t, x* , u*) dt. (14) 
Jo Jo 

Moreover, the following convergences hold 

1 ■ rr\ rj-r* 

lim J„ = 1 

ri^O ' 

lim llu- — u* \\ L i = 

■q^O 

lim \\Xr, — X* Uno = 0. 



Proof. Let r and M be given by Lemma [T] and let r/o be such that for every s ^ T* — rjo there holds 
|x*(s)| ^ r. For every n < t]q set t v = T* — rj. Let K = max[ 0j T*] \ f{ x *{t)-> u * (*))!> then 

= \x*(t v ) - x*(T*)\ ^ K(T* - t v ) = K V . 
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By Lemma[T]there exists a piecewise constant control steering x*(t^) to in time with TV{w rj ) ^ 
M. Define by 

'u*(t) for t G [0,^], 

v v (t) = < w v (t - Q for t G {t v ,T v ], (15) 

^0, fortE(T„,T], 

where T v = t rj + t v and T = sup ^ r?<r?0 T v . Denote by x v (t), t G [0,T] the trajectory of (jH) associated 
with the control v v . By construction 



Moreover 



ryK)<TF(n*| [0A] ) + M. 



lim T n = T* and lim Wvr, — tt*llri. 



r/^0 



Now set X = {x*(t) : t G [0, t v ]} U B m (0), C = sup^u |/|, Ci = sup^ oxU \d x f\, and C 2 
su Pa" xU |^u/I- For every i G [0, T] and rj < rjo one has 

- = / f(x r ,(s),v r ,(s))ds- f(x*(s),u*(s))ds 

Jo Jo 

< / r |/(x„( s ),^(s))-/(x*( s ),«*( s ))|d s + c |r*-r,| 

■/ 

< / T |/(^(«),«*(*)) - /(s*(*),u*(«))|<k + C 2 ||u* - ^|| L i + C \T* - T„| 
Jo 

<Ci / |x„(s) -x*(s)|ds + C 2 ||u*-^|| L i +Co|r*-r„|, 
jo 

thus, by Gronwall's inequality, 

IK - xloo SC (C 2 ||u* - Vr] \\ L i + C \T* - T v \) e Clf . 

In particular lim 7? _ i ,o \\ x ri — £*||oo = 0. 

Let us show the convergence (I14p . Notice that, by continuity of L, there exist constants c£l and 
C > such that 

L(t,x*(t),u*(t)) ^ c, almost everywhere in [0, T*], 



and 
Then 



\L(t,x,u)\ ^ C, for almost every (t,x,u) G [0, T] x ^ x U. 



0^/ L(t,x v (t),v v (t))dt - L(t,x*(t),u*(t))dt 



T* 



L(t,x v (t),v v (t))dt - 
^ Ct v — c(T* — t v ) = Cr rj — cq 

Therefore (|14p is proved. 

We are now in a position to prove the main result of the paper. 



L(t,x*(t),u*(t))dt 



(16) 
□ 
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Proof of Theorem^ Thanks to assumption (i), there exists a control u : [0, t u ] — > U steering xq to 
and having bounded variation. Therefore, the existence of a solution u £ : [0,t e ] — > U to (j4|), ©, ([6]), 
follows by Theorem 0] in the Appendix. Denoting by x £ the trajectory corresponding to u £ we have 
x e (0) = xo and x £ (t £ ) = 0. Let v £ be any admissible control such that TV(t>) < oo, and such that 
the corresponding trajectory satisfies ©. Then, by optimality of u £ we have 

/ L(t,x £ ,u £ )dt + eTV(u £ ) L(t, x v ,v) dt + eTV(v). (17) 

Jo Jo 

Thanks to assumptions (Hi) and (iu), and since U is compact, we are going to show that there exists 
an admissible control w : [0, t w \ — > U and a positive measurable function 7 : [0, t w ] — > IR such that the 
sequence of functions 

t ^ (f(x £ (t),u £ (t)),L(t,x £ (t),u £ (t))) 

converges to 

t ^ (f(x w (t),w(t)),L(t, x w {t),w(t)) + 7 (t)) 
with respect to the weak* topology of L°° as e tends to 0. To see this, consider the augmented system 

x = f(x, u), x N+ i = L(t, x, u), 

with constraints 

x(0) = x , X7v+i(0) = 0, x(T) = 0, X7v+i(T) € [0,T], 

where T = f Q tv L(s, x„(s), v(s)) ds+eTV (v) and v is as above. Denote by /(t, x, u) = (f(x,u),L(t,x,u)) 
the augmented dynamics. 

Notice that thanks to assumption 2, the sequence t £ is bounded and converges, up to subsequences, 
to to > as e tends to 0. Hence, given 5 > there exists Eq > such that \t e — to I < 5 for every e ^ £0 
in the chosen subsequence. Since /(0, 0) = 0, we can extend x £ ,u £ to [t s ,to + S] by setting 

x £ (s) =0,u £ (s) = 0, Vse [t e ,t + 5]. 

Since the trajectories x e are equibounded by assumption (iv), the sequence s 1— > f(s,x £ (s),u £ (s)) 
is bounded in L°°([0,to + 5] ; ^ Ar+1 )) whence, up to subsequences, it converges to a function g € 
L°°([0,to + f^M 7 ^ 1 ) in the weak* topology. Define 

x(t) = x Q + g(s)ds, x = (x ,0). 
Jo 

By construction, t H> x(t) is absolutely continuous and, denoting by x £ (t) = (x e (t), J Q L(s, x £ (s),u £ (s))ds), 

lim xJt) = x(t), Mt £ [0, t + 

£-i-0+ 

(Note that by Ascoli Arzela Theorem, since x £ is equibounded the above convergence is uniform, i.e., 
x £ converges to x uniformly on [0,to + 5].) Let x(t) = (x(t),xjv+i(t))- We are going to prove that 
x(t) is an admissible trajectory for (j3]), ([6]), corresponding to a control 10. First notice that by the 
pointwise convergence above, x(-) satisfies x(to) = 0. Define h £ (s) = f(s,x(s),u £ (s)). Then, for every 
e, h £ belongs to the set 

V = {h e L 2 {[0,t + 5],R N+1 ) I h(s) € V(s,x(s)) a.e. s € [0,t + 8]}. 

It is easy to see that V is closed in L 2 ([0,to + 5],M Ar+1 ) and, thanks to assumption (in), V is also closed 
with respect to the weak topology. Therefore, since h £ is equibounded in L 2 , up to subsequences, h £ 
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converges weakly in L 2 to h € V. By definition of V, this implies that for almost every s there exist 
w(s) G U and 7(5) ^ such that h(s) = (/(x(s), i»(s)), L(s, x(s), u>(s)) + 7(s)). 

We claim that fo(s) = g(s) for almost every s € [0,t + Indeed, for almost every s, since x £ 
converges uniformly to x 

\h £ (s) - f(s,x £ (s),u £ (s))\ < const sup |x(s) - x e (s)\ > 0. 

se[o,i +<5] 

Hence, by the dominated convergence theorem, 

h £ (s)- f{s,x £ (s),u £ {s))\ds ^0, Vi€[0,to + *]. (18) 



For all p S L 2 ([0, to + 5], R N+1 ) and for all t E [0, t + 6] we have 

<p(s)h £ (s) ds = (p(s)f(s,x £ (s),u £ (s))ds+ (p(s)(h £ (s) - f(s,x £ (s),u £ (s))) ds (19) 
Jo Jo 

The left hand side of (|19p converges to J* Q * ip(s)h(s) ds by the weak convergence of h £ in L 2 . The right 
hand side of (fTUj) converges to J* * ip(s)g(s) ds by the weak* convergence of s i-> /(s, x e (s), it £ (s)) in L°° 
and by (fTHj) . Hence, 

ip(s)h(s)ds= / tp(s)g(s)ds, \/ip € L 2 , 
■/ 

which gives h = g almost everywhere. The existence of measurable choices of s 1— > w(s), s 1— >■ 7(s) 
follows from Filippov Theorem see [6J Theorem 3.1.1] and, since h = g, we deduce that the trajectory 
corresponding to w is x. Hence in the following we denote x by x w and to by t w . 
For every admissible control v GU satisfying TV(v) < 00 and the constraints ([6]), 



L(t,x w (t),w(t))dt s$ / (L(t,x T0 (<),u;(t))+7(t))<« 

JO 

< limsupf/ L(t,x e (t),u e (t)) + eTF(u e )) 

e->0 \J0 / 



rtv 

^ / L(t,x v ,v)dt, 
Jo 

where the last inequality follows by (|17p . By density of BV in L 1 we conclude that for every control 
v € U steering xq to we have 



L(t,x w (t),w(t)) dt ^ / L(t,x v ,v)dt, 
Jo 

i.e., w is optimal for @, (0), (Jf)]) with e = 0. By uniqueness of u* we have w = u* and t^, = T*. To 
prove ©, it is sufficient to show that 

j(t)dt = 0. 



The chain of inequalities above allows to show that for every admissible control v £ U such that 
TV(v) < 00, we have 



Jo 



L(t,x*(t),u*(t))dt sC / L(t,x v ,v)dt-\\j\\ L i. (20) 
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For 77 small enough, let be the control built in Lemma [2J If TV{u*) < 00 or u* is chattering, then 
TV(v v ) < 00. Hence applying (f20|) to v = v v , we obtain that 

L(t,x*(t),u*(t))dt^ [ " L(t,x v ,v v )dt- || 7 || L i. 
Jo 

Finally, thanks to (|14|) . when 77 tend to we deduce ||7||£,i =0. □ 

4.2 Proof of Theorem [2] 

Comparing the statements of Theorems Q] and [2] one notices that in the latter there is no convexity 
assumption for velocity sets and there is no equiboundedness of trajectories required. This is due to 
the fact that the suboptimal controls are found by different methods: for Theorem Q] as the solution of 
a relaxed problem, for Theorem [2] as a truncation of the given control u* . Nevertheless, as it is evident 
from the previous section (see Lemma [2]) , convergence of the cost along truncations of u* was also a 
fundamental step in the proof of the first result. However, thanks to the regularity of the time-optimal 
map, in Theorem [2] not only quasi-optimality of the truncations is proved but also a convergence rate 
is provided. 

The proof of Theorem [2] mainly follows the outline of the proof of Lemma [2j The main difference is 
that in this case the extra assumption that the time-optimal map is Holder implies an explicit estimate 
on the rate of convergence of the costs. 

Proof of Theorem^ By assumption, there exist a ball -Br(O) such that T E C 0,a (Bji(0)). Let 770 be 
such that |x*(s)| ^ R for every s ^ T* —t]q. For every 77 ^ 770, apply Lemma[T]with To = 2T(x*(T*— 77)) 
to the point y = x*(T* — 7/). Denote by : [0,r^] — > U the piecewise constant control provided by 
Lemma [TJ Then 

t v ^ 2T(x*(T* - r/)) < 2C r \x*(T* - V )\ a ^ const rf*. 
TV{w v ) ^ M. 

Define v n by 

'u*(t) for t G [0, T* — 77], 

v v (t) = I w v (t - t v ) for t G (T* — 77, T v ], (21) 
,0 t > T v , 

where T v = T* — 77 + t v . 

Using the same argument as in the proof of Lemma [21 we obtain all the required convergences. 
Finally, thanks to (fT6|) . since t v ^ const 77°, ([8]) is proved. 

□ 

4.3 Proof of Corollary Q] 

Proof of Corollary m For every 77 > consider the control w v : [0, r„] —> U, whose existence is stated 
in Lemma [U steering x*(T* — 77) to in time such that TV(w„) M. Define by 



v v (t) 



u*(t) for t € [0, T* — 77] 

w v (t - (T* - 77)) for t € (T* - 77, T* — 77 + t v ] 



Let T rj = T* — i] + t v . Note that, by construction 

TV(v v ) < TV{u*\[o i T*-rj\) + M. 
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For e > 0, by Theorem [TJ there exists a control u £ solution of Therefore 

r 

Jo 

and by Theorem [2] 



L(t, x £ (t),u £ (t)) dt + £TV(u £ ) < / L(t, x v (t), v^t)) dt + e7V(u„), 



L(t,x v (t),v v (t))dt + £TV(v v ) ^ [ L{t,x*{t),u*{t))dt + Mr, a + £TV(v v ). 
o Jo 



Therefore 



T E t-T* 

L(t,x e (t),u e (t))dt- / L(t,x*(t),u*(t))dt ^ Mr] a + £(TV(v v ) -TV(u £ )) . (22) 

JO 

If TV(u*) is bounded then TV(v v ) < TF(«*) + M, whence e(TF(^) - Tl/(u £ ) < £{TV{u*) + M) 
and we are done. 

If TV(u*) = oo then TV(u £ ) — > oo as e — )• 0. Indeed, if TV(it e ) ^ C for every e > then there 
exists a BV function u such that This implies that 



lim / L(t,x £ ,u £ )dt = / L(t,Xu,u)dt. 



e->0+ 

Hence u is optimal for the problem with e = and by uniqueness, u = u*. On the other hand, by 
lower semicontinuity of TV(-), we have that 

TV(u*) < liminf TV{u £ ) ^ C, 

e->0 

which is a contradiction. 

Notice that the function 77 i— > TV(ii* Ip^T 1 "—^]) is non-decreasing. Hence for every e > let 77(e) be 
maximal such that TV(u*\^ 0tT *_ v ^) ^ TV(u £ ). 
With this choice of r\ we have 

TV(v v{£) ) - TV(u £ ) = TV{u%^_ n{£)] ) + TV(w v{£) ) - TV(u £ ) = TV{w v{e) ) ^ M. 
Thus (|22p with /j = 77(e) and taking the maximum between M and M we have ([9]). □ 

5 Appendix: an existence result 

Theorem 4. Consider the optimal control problem 

A = /(t,x,u), x(0)eMo, x(t„)€Mi (23) 
/ii(x(t)) >0,...,/i,(s(t)) ^0 Vt (24) 
u € W = {u : [0, i u ] — > U measurable, t u ^ 0} (25) 

min( [ L(s,x(s),u{s))ds + aTV{u)] , (26) 
new VJo ' / 

where f : R X R-^ x U — > R is measurable w.r.t. t, locally Lipschitz w.r.t. x, hi, . . .h\ G C°(M Ar ), 
L G C°(R x R-^ x R m ), U C R m is compact, M$,M\ are compact submanifolds, and a > 0. Assume 
that 
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(i) Va; € M° U M 1 , h x {x) > 0, . . . fy(x) ^ 0; 

t/iere exists u € £•/ having bounded variation, steering Mq to Mi , and such that the corresponding 
trajectory satisfies the constraints ()24|) ; 

(^m) there exists b > suc/i i/iai, /or every u ZLlA and every trajectory x u of (|23|) corresponding to u, 
we have 

tu ||^u||oo ^ b. 

Then the optimal control problem (|2"5|) , ([24"]) . (|2o]) admits a solution. 
Proof. Let 

(5 = inf L(s, x(s), u(s)) o!s + aTV(u)^j , 

where the infimum is taken among all controls u € IA steering Mq to M\ and whose corresponding 
trajectory satisfies (|24p . Let ti n : [0,T n ] -^Ubea minimizing sequence of admissible controls, i.e., 

lim ( / L(s,x n (s),u n (s))ds + aTV(u n ) ) = 5, 

where x n is a trajectory corresponding to u n . Then, thanks to assumption (in), for n sufficiently large 
we have that 

aTV(u n ) ^ / L(s,Xu{s),u(s)) ds + aTV(u) + C, 
Jo 

for some constant C ^ 0, which implies that u n is a bounded sequence in BV. Then by a standard 
compactness result in BV there exists a subsequence, still denoted by u n , and a control u a such 
that 

lim | \u n — u a \ \ L i = 0. 

Up to a subsequence, let u n converge to u a almost everywhere, T n converge to t a and x n (0) converge 
to x° a . We are going to prove that u a : [0, t a ] ->• U is a solution to (EHJ) , (JMI, ([IS]). By pU Theorem 1 
p. 56], the convergence almost everywhere of u n to u a implies that if x a satisfies 

X a — f (t, X a ,U a ) , X a (0) — x a 

then x n converges uniformly to x a . Hence x a satisfies the constraints (}24"|) and, by compactness of Mq 
and Mi, we obtain that x a (t a ) € M\. Hence u a is admissible. Moreover, 

lim L(t,x n (t),u n (t)) = L(t, x a (t) , u a (t)) , for almost every t. 
Hence, thanks to assumption (Hi), the dominated convergence theorem allows to conclude that 

lim / L(t,x n (t),u n (t))dt= / L(t,x a (t),u a (t))dt (27) 



'"^Jo Jo 
On the other hand, by lower semicontinuity of TV(-) we have 

TV(u a ) ^ lim inf TV(u n ). (28) 

n— >oo 

Using ()27|) . (|28p and since u n is a minimizing sequence, we infer that 

L(t,x a (t),u a (t))dt + aTV(u a ) sC 5 

o 

which implies that u a is optimal. □ 

In contrast with usual existence results see [7], we do not assume convexity of the dynamics. The 
conclusion is assured by the term of the total variation in the cost functional. 
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